Statistical Methods in Measuring Search Engine Performance
نویسنده
چکیده
There is a growing need for a set of benchmarking tests for measuring search engines, because research on search engine evaluation is inconsistent in methods and approaches. Using these tests researchers can make comparisons between different search engines and carry out longitudinal studies on a particular search engine with a degree of consistency. Several problems emerge during the evaluations of search engines which are needed to be solved to measure search engine performance objectively. This paper presents a brief review of the different statistical methods and approaches utilized in search engine evaluations. It describes six significant research projects with their fulfilled activities and statistical findings. It also serves as a guideline for choosing an appropriate statistical method for use in search engine evaluations. 1. General problems related to search engine evaluations Before discussing issues connected to search engine evaluation, a clear definition of search engine will be given. The term search engine refers to a general class of programs that enable users to search for documents on the World Wide Web. These programs index documents, then attempt to match documents relevant to a user’s search requests. In evaluations researchers measure the quality of Web search services which can be determined on the basis of several measures. However, it is extremely difficult to find reliable measures that reflect this quality appropriately. Oppenheim [8] suggested that benchmarking tests should include the following measures at the very minimum: 1. precision; 2. relative recall; 3. speed of response; 4. consistency of results over an extended period; 5. proportion of dead or out of date links; 6. proportion of duplicate hits; 7. overall quality of results as rated by users; 8. evaluation of GUI for user friendliness; 9. helpfulness of help and variations of software for new and experienced users; 10. options for display of results; 11. presence of adverts; 12. coverage; 13. estimated/expected search length; 14. length and readability of abstracts; 15. search engine effectiveness. 2000 Mathematics Subject Classification. 62P30, 68M20.
منابع مشابه
Optimizing question answering systems by Accelerated Particle Swarm Optimization (APSO)
One of the most important research areas in natural language processing is Question Answering Systems (QASs). Existing search engines, with Google at the top, have many remarkable capabilities. But there is a basic limitation (search engines do not have deduction capability), a capability which a QAS is expected to have. In this perspective, a search engine may be viewed as a semi-mechanized QA...
متن کاملMethods for Measuring Search Engine Performance over Time
This study introduces methods for evaluating search engine performance over a time period. Several measures are defined, which as a whole, describe search engine functionality over time. The use of these measures is illustrated though a specific example.
متن کاملA New Hybrid Method for Web Pages Ranking in Search Engines
There are many algorithms for optimizing the search engine results, ranking takes place according to one or more parameters such as; Backward Links, Forward Links, Content, click through rate and etc. The quality and performance of these algorithms depend on the listed parameters. The ranking is one of the most important components of the search engine that represents the degree of the vitality...
متن کاملReview of ranked-based and unranked-based metrics for determining the effectiveness of search engines
Purpose: Traditionally, there have many metrics for evaluating the search engine, nevertheless various researchers’ proposed new metrics in recent years. Aware of this new metrics is essential to conduct research on evaluation of the search engine field. So, the purpose of this study was to provide an analysis of important and new metrics for evaluating the search engines. Methodology: This is ...
متن کاملMeasuring the Effectiveness of Public Search Engines on Thai Queries
This paper compares the retrieval performance of seven public search engines on Thai queries by using a blind evaluation. Two of the engines used in this study are Thaifocused search services, while the remaining engines are large, commercial search engines that have wider collections of web data and support several languages. The results are compared by Mean average precision (MAP) and Mean re...
متن کامل